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Serine/threonine kinase 39 (STK39) gene has been reported to be a hypertension-susceptibility gene by a 
recent genome-wide association study in Western populations. To validate this finding in Chinese, we 
focused on five well-characterized common polymorphisms in STK39 gene to examine their potential 
association with hypertension in a large northeastern Han population. This is a hospital-based case-control 
study involving 1009 hypertensive patients and 756 normotensive controls. Data were analyzed by the 
Haplo.Stats and multifactor dimensionality reduction (MDR) softwares. The genotype and allele 
distributions of rs6749447, rs3754777 and rs6433027 differed significantly between patients and controls 
(P < 0.001) even after the Bonferroni correction. The majority of derived haplotypes also showed 
remarkable differences between the two groups (P < 0.001). As indicated by MDR analysis, a three-locus 
model including rs6749447, rs35929607 and rs3754777 was selected as the overall best with a larger testing 
accuracy of 0.7309 and a maximum cross-validation consistency of 10 (P < 0.001). The utility of this model 
was reinforced by a Logistic regression analysis. Taken together, our findings suggest the potential 
interactive role of STK39 gene multiple polymorphisms in the development of hypertension among 
northeastern Han Chinese. 



Hypertension is a common complex disease that causes significant morbidity and mortality worldwide. 
Evidence is mounting suggesting that arterial hypertension is a leading risk factor for global disease 
burden with 16.5% of all deaths attributable to high blood pressure (BP) 1 . Also hypertension is a polygenic 
disease with a considerable genetic component 2 . Considering the fact that some individuals are more susceptible 
to hypertension than others 3,4 , it is therefore of timely importance to explain this inter-individual difference in 
disease susceptibility in order to improve the existing therapy and prevention strategies. 

Not until a recent genome-wide association study by Wang et al in Western populations 5 has the potential 
contributory role of serine/threonine kinase 39 (STK39) gene in the pathogenesis of hypertension has drawn 
global attention 6 . More recently, Xi et al performed a meta-analysis of 10 studies on STK39 gene rs3754777 and 
found significant association in Europeans and East Asians, but not in Africans 7 . There exists a common 
drawback for these two studies, that is, the genetic defects of STK39 gene were examined individually without 
allowing for their potential interactions. Such interaction cannot be overlooked because polymorphisms do not 
exist in isolation, and it may be the combination of base changes at several loci that influences gene function 8 '. 

STK39 is a sterile 20-like-related proline-alanine-rich kinase that not only mediates the phosphorylation of 
several cation-chloride-coupled cotransporters to maintain salt-water homeostasis but also interacts with the 
components of p38 MAP kinase pathway to attenuate the injury of cellular stress 1011 . The gene encoding STK39 
consists of 18 exons and spans about 300 kb on chromosome 2q24.3. The genomic sequence of STK39 gene is 
polymorphic and so far 34 polymorphisms have been validated by SNPbrowser™. Several genetic polymorph- 
isms in STK39 gene have been reported to be associated with hypertension; however, the results are often not 
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reproducible 12 " 16 . To generate more information and unveil an epi- 
static picture, we in this study focused on five well- characterized 
common polymorphisms of STK39 gene, aiming to examine their 
individual and interactive association with the risk of hypertension 
among northeastern Han Chinese. 

Methods 

Study population. This was a hospital-based case-control study. Altogether, 1765 
unrelated subjects of Han nationality were enrolled from the local residents of 
Qiqihar city, Heilongjiang province in northeast China. The Ethics Committee of 
Qiqihar Medical University approved this study that was conducted in accordance 
with the guidelines outlined in Declaration of Helsinki. All subjects signed informed 
consent at the time of enrollment. 

All study subjects were assigned to the hypertensive group or normotensive group 
based on clinical examinations and laboratory measurements. Essential hypertension, 
with an unknown etiology, is the cause of 95% of all cases of diagnosed hypertension. 

The hypertensive group consisted of inpatients or outpatients of the Second 
Affiliated Hospital of Qiqihar Medical University. Hypertensive patients with clinical 
evidence of secondary hypertension and renal disease were excluded. The normo- 
tensive controls who underwent a medical examination at the same hospital were 
clinically confirmed to be free of hypertension and had a negative family history of 
hypertension in their first-degree relatives. All study subjects were genetically unre- 
lated local residents of Han Chinese nationality who were recruited consecutively 
between June 2008 and December 2012. 

Diagnosis. Hypertension is defined as a mean systolic BP of at least 140 mm Hg or a 
mean diastolic BP of at least 90 mm Hg or the current intake of antihypertensive 
drugs. BP was measured using a calibrated mercury sphygmomanometer with an 
appropriate adult cuff size by certified examiners. 

As recommended by Tobin et al 17 , for subjects under antihypertensive treatment, 
BP was imputed by respectively adding 15 and 10 mmHg for systolic and diastolic BP, 
and this imputation was also adopted by Newton-Cheh et al 18 . Unless otherwise 
indicated, the imputed systolic and diastolic BP was used in the analysis. 

Sample size. The hypertensive group included 1009 sporadic patients with a mean 
age of 64.48 (standard deviation: 8.53) years. Males comprised 54.31% of the 
hypertensive group. The remaining subjects (n — 756) whose blood pressure was 
normal formed the age-, gender- and ethnicity- matched control group. 

Demographic and clinical measurement. At the time of enrollment, the data on age, 
gender, body weight and height were recorded. Body weight and height were 
measured when subjects in light clothes and bare feet to the nearest 0. 1 kg and 0. 1 cm, 
respectively. Body mass index (BMI) was computed as the weight in kilograms 
divided by height in meters squared. 

Fasting venous blood was drawn from each subject, and the upper sera were 
isolated immediately by centrifugation. The plasma levels of triglycerides, total 
cholesterol, high-density lipoprotein cholesterol (HDL-C), blood urea nitrogen, 
creatinine and urea acid were determined enzymatically using available kits and auto- 
analyzers. High -sensitivity C-reactive protein (hsCRP) was measured using a 
particle -enhanced immunoturbidimetric method. 

Polymorphism selection. With the use of PubMed and Online Mendelian 
Inheritance in Man datasets, five intronic common polymorphisms in STK39 were 
selected, including rs6749447, rsl2692877, rs35929607, rs3754777 and rs6433027. 
These polymorphisms have been reported to be associated with hypertension or BP 
by many studies 5 ' 71619 ' 20 . The genomic organization of human STK39 gene and the 
localization of five examined polymorphisms are provided in Supplementary material 
online, Figure SI. 

Genotyping. Genomic DNA was isolated from peripheral blood leukocytes 
according to a standard phenol-chloroform method and stored at — 40 U C until 
required for batch genotyping. Genotypes of all examined polymorphisms were 
determined by a polymerase chain reaction-ligase detection reaction (PCR-LDR) 
method as previously described 9 . 

The PCR reaction was conducted in an EDC-810 Amplifier {Dongsheng 
Innovation Biotech Co., Ltd., Beijing, China). The cycling parameters were as follows: 
94 Q C for 2 min; 35 cycles of 94°C for 20 s, 60°C for 20 s and 72°C for 20 s; and a final 
extension step at 72°C for 3 min. 

Two specific probes were synthesized for each polymorphism to discriminate 
specific bases with one common probe labeled by 6 -carboxy- fluorescein (FAM) at the 
3' end and by phosphorylated at the 5' end. The multiplex ligation reaction was 
carried out in a reaction volume of 10 ul containing 2 ul of PCR product, 1 ul 10 X 
Taq DNA ligase buffer, 1 uM of each discriminating probe, 5 U Taq DNA ligase with 
the ligation parameters of 30 cycles of 94C for 30 s and 56°C for 3 min. After the 
reaction, 1 ul LDR reaction product was mixed with 1 ul ROX passive reference and 
1 ul loading buffer, and then denatured at 95 "C for 3 min and chilled rapidly in ice 
water. The fluorescent products of LDR were differentiated using the ABI 3730XL 
sequencer {Applied Biosystems, USA). 

To test the accuracy of this PCR-LDR method, 96 DNA samples were randomly 
selected and run in duplicates with 100% concordance. 



Statistical analysis. Statistical analyses were completed with the STATA software 
(version 11.0) for Windows (StataCorp LP, College Station, TX, USA). Study power 
was computed by the Power and Sample Size Calculations (PS) software (v3.0.7). 

Demographic indexes and clinical biomarkers were compared between patients 
and controls by the unpaired t-test or the Mann-Whitney U test or the j_ 2 test where 
appropriate. The deviation from Hardy- Weinberg equilibrium was assessed by a 
Pearson goodness-of-fit test. The genotype and allele distributions of five examined 
polymorphisms between the two groups were compared by the j_ 2 test. 

Each genotype of examined polymorphisms was regressed in a Logistic model, 
assuming the additive (major homozygotes versus heterozygotes versus minor 
homozygotes), the dominant (major homozygotes versus heterozygotes plus minor 
homozygotes) and the recessive (major homozygotes plus heterozygotes versus 
minor homozygotes) models of inheritance with covariates of age, gender and BMI. 
Risk estimates are expressed as the odds ratio (OR) and its 95% confidence interval 
(95% CI). 

The frequencies of haplotypes derived from five examined polymorphisms in 
STK39 gene were estimated by the haplo.em program. This program computes the 
maximum likelihood estimates of haplotype probabilities using the progressive 
insertion algorithm which progressively inserts batches of loci into haplotypes of 
growing lengths. In this study, only haplotype with estimated frequency of greater 
than 1% was considered. P value was simulated on 1000 replicates. In addition, the 
haplo. score program was used to model a subject's intermediate phenotype as a 
function of each inferred haplotype, weighted by their estimated probability, to 
account for the haplotype ambiguity. This program provides the global and haplo- 
type-specific test 21 . The haplo.em and haplo. score programs are implemented in the 
Haplo. Stats software (version 1.4.0) operated in the R language (version 2.14, avail- 
able at the website http://www.r-project.org). 

The open-source data-mining multifactor dimensionality reduction (MDR) 
approach 22,23 (version 3.0, available at the website www.epistasis.org) was employed to 
identify and characterize the interaction of STK39 gene multiple polymorphisms. 
This approach aims to construct all possible combinations of examined poly- 
morphisms and selects the overall best model. The accuracy of each model is eval- 
uated by a Bayes classifier in the context of 10-fold cross validation. A single best 
model simultaneously has the maximum testing accuracy and cross-validation con- 
sistency (a measure of the number of times of 10 divisions of the dataset that the best 
model is extracted). Statistical significance was evaluated using a 1000-fold per- 
mutation test to compare the observed testing accuracy with the expected one under 
the null hypothesis of null association. The permutation test corrects for multiple 
testing by repeating the entire analysis on 1000 datasets that are consistent with the 
null hypothesis. Further to test the utility of MDR analysis, a classical Logistic 
regression analysis was undertaken. 

Furthermore, an interaction entropy graph was depicted to quantify the synergistic 
and non-synergistic interactions between examined polymorphisms. This graph is 
implemented in MDR approach. The information gain value expressed as the per- 
centage in the node signifies the independent main effect of each examined poly- 
morphism. For the information gain value on the connected line, the positive, 
negative and zero values indicate synergism (the red or the orange line), redundancy 
(the green or the blue line) and independence (the yellow line), respectively. 

Results 

Baseline characteristics. Demographic indexes and clinical bio- 
markers are compared between hypertensive patients and normo- 
tensive controls in Table 1. The distribution of age (P = 0.751) and 
gender (P = 0.843) did not differ significantly between the two 
groups. Relative to controls, the levels of BMI (P < 0.001), systolic 
and diastolic BP (both P < 0.001), fasting glucose (P < 0.001), 
triglycerides (P = 0.024), creatinine (P = 0.002) and hsCRP (P < 
0.001) were significantly higher in patients, whereas the HDL-C level 
was significantly lower (P < 0.001). No significant difference was 
observed for the other clinical biomarkers (P > 0.05). The 
proportion of patients undergoing antihypertensive treatment was 
14.17%. 

Single-locus analysis. The genotype distributions and allele frequen- 
cies of five examined polymorphisms in STK39 gene are provided in 
Table 2. There was no deviation from the Hardy- Weinberg 
equilibrium for all polymorphisms (P > 0.05). Overall differences 
were significant for the genotype and allele distributions of 
rs6749447, rs3754777 and rs6433027 between patients and 
controls (P < 0.001 for all) even after the Bonferroni correction 
(Bonferroni significance threshold P = 0.05 divided by the total 
number of 5 examined polymorphisms: P = 0.01). The power to 
reject the null hypothesis of no differences in risk allele frequencies 
of rs6749447, rs3754777 and rs6433027 between the two groups was 
100%, 100% and 98.4%, respectively. 
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Table 1 | The characteristics of the study population 



Characteristics 


Patients (n = 1 009) 


Controls (n = 756) 


p:|: 


nuc lycjui oi 


64 48 + 8 53 


64 23 + 1 0 1 3 


0.75 1 


("^pnripr Imn p^ %1 

WCI IUCI 1 1 1 l\A 1 Co , /O J 


54.31 


53.84 


0.843 


BMI tka/m 2 l 

UJ V II I l\VJ/ 111 1 


27.89 ± 6.29 


23 1 8 + 3 77 


<0.001 


Anti nvnprtprKivp trpntmpnt \/r\ 

iW MM lyUCI ICl IjIVU m cumiici M 1 / O J 


14.17% 


0% 


<0.001 


Adiusted SBP ImmHnl 

/\*wl UolCU wL/l llllllll 1 KJ 1 


147 31 + 16 5? 


1 09 76 + 1 7 97 


<0.001 


ArJinstpri HRP fmmHnl 

AU UilCU !_/ Ul llllllll ILJI 


qo 09 + 1 5 92 


71 37 + 11 43 

/ 1 ,\J f — 1 1 .*+iJ 


<0.001 


Fasting glucose (mmol/L) 


6.14 ± 2.15 


5.33 ±1.12 


<0.001 


TG (mmol/L) 


1 .90 ± 1 .04 


1 .77 ± 0.95 


0.024 


TC (mmol/L) 


4.59 ±1.18 


4.59 ± 0.91 


0.995 


HDL-C (mmol/L) 


1.12 ±0.32 


1.24 ±0.34 


<0.001 


BUN (mmol/L) 


5.92 ± 3.88 


5.79 ± 4.35 


0.557 


Creatinine (u.mol/L) 


87.47 ± 36.79 


81.80 ± 25.13 


0.002 


Uric acid (u.mol/L) 


329.12 ± 100.31 


333.98 ± 95.54 


0.374 


hsCRP (mmol/L) 


12.37 ±41.42 


2.21 ± 3.71 


0.001 



Abbreviations: BMI, body mass index; SBP, systolic blood pressure; DBP, diastolic blood pressure; TC, total cholesterol; TG, triglycerides; HDL-C, high-density lipoprotein cholesterol; BUN, blood urea 
nitrogen; hsCRP, high sensitivity C-reactive protein. 

*P values were computed by the unpaired t-test or the Mann-Whitney U test for quantitative variables and by the x 2 test for qualitative variables. 



Table 3 summarizes the risk prediction of five examined poly- 
morphisms in STK39 gene for hypertension under three genetic 
models of inheritance. Compared with the additive model, the risk 
estimates of rs6749447 and rs3754777 were slightly increased under 
the dominant model, whereas this increase was obvious under the 
recessive model. Taking rs3754777 as an example, the odds of having 
hypertension was 3.44 (P < 0.001), 3.94 (P < 0.001) and 6.23 (P < 
0.001) under the additive, dominant and recessive models, respect- 
ively. It is worth noting that this significant association was inde- 
pendent of the confounding factors including age, gender and BMI. 
In contrast to rs6433027, the crude highest risk estimate was seen 
under the dominant model (OR = 1.82; 95% CI: 1.45-2.28; P < 
0.001) but attenuated after adjusting for the confounders (OR = 
1.63; 95% CI: 1.26-2.10; P < 0.001). 

Haplotype analysis. The haplotype frequencies of five examined 
polymorphisms in STK39 gene are compared between patients and 
controls in Table 4. To improve the power to detect an association, 
only haplotype with frequency of at least 1% in overall subjects was 



analyzed. All derived haplotypes accounted for 96.77%, 95.18% and 
97.49% of the cumulative variance in overall subjects, patients only 
and controls only, respectively. 

The frequencies of most derived haplotypes differed significantly 
(P £ 0.001) between the two groups, except for three low-penetrance 
haplotypes C-C-G-G-C (alleles in order of rs6749447, rsl2692877, 
rs35929607, rs3754777 and rs6433027) (P = 0.115), A-G-A-G-C (P 
= 0.393) and C-C-G-G-T (P = 0.460) even after the Bonferroni 
correction (Bonferroni significance threshold P = 0.05 divided by 
the total number of comparisons (n = 17): P = 0.0029). 

Haplotype-phenotype analysis. When treating all derived 
haplotypes as a whole, the omnibus association with demographic 
indexes and clinical biomarkers is presented in Table 5. In 
hypertensive patients, the marginally significant association was 
observed for SBP (global simulated P = 0.019), HDL-C (global 
simulated P = 0.017) and BUN (global simulated P = 0.047). 
There was no observable significance between haplotypes and 
phenotypes in controls. 



Table 2 | Genotype distributions and allele frequencies of five examined polymorphisms in STK39 gene between hypertensive patients and 
normotensive controls 



Polymorphism* 




Patients (n = 1009) 


Controls (n = 756) 


X 2 




rs6749447 


CC 


488 (48.36) 


566 (74.87) 








CA 


435 (43.1 1) 


175 (23.15) 


132.97 


<0.001 




AA 


86 (8.52) 


15 (1.98) 








A allele 


30.08% 


13.56% 


133.20 


<0.001 


rs 12692877 


GG 


710 (70.37) 


531 (70.24) 








GC 


258 (25.57) 


208 (27.51) 


4.95 


0.084 




CC 


41 (4.06) 


17(2.25) 








C allele 


16.85% 


16.01% 


0.45 


0.504 


rs35929607 


AA 


309 (30.26) 


236 (31.22) 








AG 


499 (49.45) 


377 (49.87) 


0.29 


0.866 




GG 


201 (19.92) 


143 (18.92) 








G allele 


44.65% 


43.85% 


0.22 


0.636 


rs3 754777 


GG 


562 (55.70) 


629 (83.20) 








GA 


384 (38.06) 


1 19 (15.74) 


152.86 


<0.001 




AA 


63 (6.24) 


8 (1.06) 








A allele 


25.27% 


8.93% 


154.62 


<0.001 


rs6433027 


CC 


178 (17.64) 


212 (28.04) 








CT 


490 (48.56) 


358 (47.35) 


33.52 


<0.001 




TT 


341 (33.80) 


186 (24.60) 








T allele 


58.08% 


48.28% 


33.39 


<0.001 



*Genotypes of five examined polymorphisms were expressed as count (percentage). 

**P values were computed by using the % 2 test based on the 3 X 2 contingency tables for the genotype comparison and on the 2 X 2 contingency tables for the allele comparison. 
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Table 3 | Individual risk prediction of five polymorphisms examined in STK39 gene for the occurrence of hypertension under the additive, 
dominant and recessive models of inheritance 



1 1 ■ 
Polymorph ism- 




.... 

Additive model 


• 

Dominant model 


Recessive model 


(wild/mutant allele) 


.. . 
Adjusted* 


(mm vs. Wm vs. WW) 


(mm plus Wm vs. WW) 


(mm vs. Wm plus WW) 


rs6749447-C/A 


No 


2.78; 2.32-3.33; <0.001 


3. 18; 2.59-3.91 ;<0.001 


4.60; 2.64-8.03; <0.001 




Yes 


2.41; 1.98-2.93; <0.001 


2.71; 2.16-3.40; <0.001 


3.91; 2.1 8-7.01 ;<0.001 


rsl 2692877-G/C 


No 


1 .06; 0.89-1 .27; 0.5 1 3 


0.99; 0.81-1 .22; 0.953 


1 .84; 1 .04-3.27; 0.037 




Yes 


1.11; 0.9 1 -1 .35; 0.3 1 2 


1.08; 0.86-1 .36; 0.502 


1.54;0.81-2.92;0.185 


rs35929607-A/G 


No 


1.03; 0.90-1. 18; 0.635 


1.03; 0.84-1 .26; 0.790 


1.07; 0.84-1.35; 0.598 




Yes 


0.98; 0.84-1. 14; 0.802 


1.01; 0.81-1 .27; 0.919 


0.93; 0.71-1 .21; 0.574 


rs3754777-G/A 


No 


3.44; 2.79-4.24; <0.001 


3.94; 3.14-4.95; <0.001 


6.23; 2.97-1 3.07; <0.001 




Yes 


3.78; 2.99-4.77; <0.001 


4.43; 3.42-5.74; <0.001 


7.41; 3.39-1 6. 17; <0.001 


rs6433027-C/T 


No 


1.47; 1.29-1.68; <0.001 


1.82; 1.45-2.28; <0.001 


1.56; 1.27-1.93; <0.001 




Yes 


1.44; 1.24-1. 67; <0.001 


1.63; 1 .26-2.10; <0.001 


1.64; 1.30-2.08; <0.001 



Abbreviations: m, the mutant allele; W, the wild allele. Data were expressed as odds ratio; 95% confidence interval; P for three genetic models of inheritance. *Risk estimates were calculated with or without 
adjusting for age, gender and body mass index. 



Interaction analysis. In view of the significant findings in the 
haplotype analysis, it is of great interest to explore the potential 
interaction of five examined polymorphisms in STK39 gene. To 
achieve this goal, a promising data-mining analytical approach 
MDR was employed (Table 6). Each best model across all possible 
combinations is assessed by the testing accuracy, cross-validation 
consistency and significance level. 

Compared with the other models, a four-locus model incorporat- 
ing rs6749447, rs35929607, rs3754777 and rs6433027 was the best 
with the maximum testing accuracy of 0.7329 and cross-validation 
consistency of 10 out of 10. However, the best three-locus model 
including rs6749447, rs35929607 and rs3754777 also had a max- 
imum cross-validation consistency of 10 and higher testing accuracy 
of 0.7309, secondary to the best four-locus model. The best three- and 
four-locus models were both significant at P < 0.001, indicating that 
a model this good or better was observed less than 1 out of 1000 
permutations and thus unlikely hinged on the null hypothesis of null 
association. In view of this subtle difference, to pinpoint the poly- 
morphisms that are of particular interest, the three-locus model was 
regarded as the overall best MDR model in this study. 

Additionally to test the utility of MDR analysis, a classical Logistic 
model was conducted with the outcome variable of hypertension 
status, predictor variable of the product of three polymorphisms 
(rs6749447, rs35929607, rs3754777) in the best three-locus model, 



and covariates of age, gender and BMI. Consistently, the interaction 
of these three polymorphisms was found to be significantly assoc- 
iated with a 1.44-fold (95% CI: 1.36-1.53; P < 0.001) increased risk of 
hypertension under the additive model. After adjusting for the con- 
founding factors, this interaction was associated with a 1.40-fold 
increased risk (95% CI: 1.32-1.49; P < 0.001). 

Interaction graph. Supplementary Figure S2 depicts the interaction 
graph of five examined polymorphisms in STK39 gene. The inde- 
pendent main effect was largest for rs3754777 with the information 
gain value of 6.65%, whereas this largest effect was remarkably 
attenuated by interacting with the other polymorphisms. Notably, 
the interaction of rs3754777 with rs6749447 was antagonistic with 
the information gain value of 2.12%. What's more, the interaction of 
rs6749447 with rs6433027 resulted in a negative information gain 
value of -1.25%. 

Discussion 

With a relatively large study population from northeast China, we 
examined the association of five well-characterized common poly- 
morphisms in STK39 gene with the risk of hypertension, and found 
that three of them exhibited independent significant association in 
the single-locus analysis. To the authors' knowledge, this is the first 
report assessing the interactive contribution of STK39 gene multiple 



Table 4 | Frequencies of derived haplotypes (> 1 % in total subjects) from five examined polymorphisms in STK39 gene between hypertensive 
patients and normotensive controls 



Haplotype (%)* 


Total (n = 1765) 


Patients (n = 1009) 


Controls (n = 756) 


Simulated P 


C-GA-G-T 


13.18 


15.68 


1 1.70 


<0.001 


C-G-G-G-T 


12.27 


14.59 


10.98 


0.001 


C-GA-G-C 


12.14 


20.67 


5.34 


<0.001 


C-G-G-G-C 


1 1.45 


14.38 


8.90 


<0.001 


A-G-G-G-T 


8.36 


3.13 


1 1.70 


<0.001 


CGA-AC 


8.07 


1.78 


13.03 


<0.001 


GGAA-T 


6.23 


2.76 


8.21 


<0.001 


A-G-G-G-C 


4.31 


3.14 


5.45 


<0.001 


A-G-A-G-T 


4.08 


2.55 


5.38 


<0.001 


C-C-A-G-T 


3.49 


4.76 


2.97 


0.001 


C-C-G-G-C 


2.64 


2.70 


2.74 


0.1 15 


A-G-A-G-C 


2.38 


2.33 


2.47 


0.393 


C-C-A-G-C 


1.97 


3.60 


0.33 


<0.001 


A-C-G-G-T 


1.80 


0.73 


2.57 


<0.001 


C-C-G-G-T 


1.68 


1.57 


1.69 


0.460 


C-C-A-A-C 


1.40 


0.27 


2.45 


<0.001 


C-C-A-A-T 


1.32 


0.54 


1.58 


<0.001 



•Alleles in a haplotype were in order of rs6749447, rsl 2692877, rs35929607, rs3754777 and rs6433027 in STK39 gene. 
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Table 5 Global testing of 


all derived haplotypes 


with anthro- 


pometric indexes 


and clinical biomarkers 


in hypertensive patients 


and normotensive 


controls 










Patients 


Controls 


Characteristics Global-stat 


Global P sim 


Global-stat 


Global P Sim 


Age 


27.15 


0. 1 46 


3 1 .60 


0.283 


Gender 


8.58 


0.793 


31 .08 


0.198 


BMI 


36.07 


0.051 


41.24 


0.125 


Adjusted SBP 


56.12 


0.019 


28.86 


0.308 


Adjusted DBP 


21.89 


0.319 


21.38 


0.562 


Triglyceride 


30.50 


0.102 


9.40 


0.999 


TC 


32.81 


0.074 


27.63 


0.321 


HDL-C 


61.79 


0.017 


1 1.47 


0.817 


Glucose 


15.64 


0.167 


14.85 


0.196 


BUN 


27.89 


0.047 


5.79 


0.713 


Creatinine 


9.09 


0.606 


6.35 


0.836 


Uric acid 


20.68 


0.363 


8.69 


0.839 


hsCRP 


5.99 


0.705 


7.47 


0.719 


Abbreviations: BMI, body 


nass index; SBP, systolic blood pressure, DBP, diastolic blood pressure; 


TC, total cholesterol; HDL-C, high-density lipoprotein cholesterol; BUN, blooc 


urea nitrogen; 


hsCRP, high sensitivity C-reactive protein 









polymorphisms in susceptibility to hypertension. These findings not 
only deepen our understanding that complex genetic interactions 
account for disease risk but also provide evidence supporting the 
logical functional role of STK39 in the pathogenesis of hypertension. 

Recently, we have genotyped four top polymorphisms in STK39 
gene with genome- wide significance among 1108 southern Han 
Chinese from Shanghai. Unfortunately none of these polymorph- 
isms were successfully replicated for hypertension risk 16 . Moreover, 
another replication in a British Caucasian cohort also failed to detect 
any relationship between these polymorphisms and BP variation 19 . 
However, in a recent meta-analysis by Xi et al, STK39 gene rs3754777 
was observed to be significantly associated with hypertension risk in 
Europeans and East Asians 7 , in agreement with the results of this 
study among northeastern Han Chinese. Besides, we additionally 
identified another two polymorphisms in STK39 gene that individu- 
ally showed independent association with hypertension even after 
the stringent Bonferroni correction. A possible explanation is genetic 
heterogeneity across races and ethnicities. For example, the fre- 
quency of rs6749447 minor A allele was 23% in this study popu- 
lation, which was slightly lower than that (28%) observed in a 
Caucasian cohort from HapMap dataset. Contrastingly, the iden- 
tification in a South African cohort yielded an exceedingly high 
frequency at 53% 19 . It is not uncommon to encounter genetic het- 
erogeneity in any disease identification strategy 24 , which can be 
somewhat avoided by studying homogeneous populations 25 . In fact, 
our study population is characterized by genetic homogeneity and 
geographic stability. Moreover, the subjects are most likely uniform 
in terms of environmental exposures, including habitual dietary 
intake of high amount of salt and fat 26 and a lower rate of hyperten- 
sion recognition and treatment. These factors render this population 
more appropriate for enhancing the understanding of genetic pre- 



disposition to hypertension. Furthermore, it cannot be fully ruled out 
that the evolutionary history of linkage disequilibrium patterns var- 
ies significantly across different ethnic populations. Usually a locus is 
in close linkage with another nearby causal locus in one ethnic group 
but not in another 27 . As a consequence, there is a need to construct a 
database of hypertension-susceptibility genes or polymorphisms in 
each racial/ethnic group. 

Another possibility for these inconsistent findings is the potential 
modifiable impact of environmental or lifestyle factors on genetic 
predisposition. Genetic factors may increase an individual's vulner- 
ability to develop a disease; however its occurrence largely depends 
on the exposure to a certain environment or lifestyle. For example, a 
high-salt diet is known to induce or aggravate the development of 
hypertension in both animal models and human beings 28,29 . In China, 
the amount of salt consumption differs remarkably between north- 
ern and southern Chinese. According to a multi-center observation 
in China, the average urinary sodium excretion in northern Chinese 
(271 mmol per day) is nearly double the amount of that in southern 
Chinese (139 mmol per day), leading to average 7.4 and 6.9 mmHg 
increases of systolic and diastolic BP, respectively 30 . In view of the 
divergent STK39-hypertension association between northeastern 
(the present study) and southern Chinese 16 , it seems reasonable to 
speculate that the high-salt intake may modify an individual's genetic 
predisposition to hypertension. In fact, STK39 encodes a protein 
kinase more commonly known as SPAK, which participates in a 
signaling pathway that activates thiazide-sensitive NaCl cotransport 
in the kidney. An emerging body of evidence demonstrates that 
genetic defects that increase STK39 expression can alter renal salt 
handling and the BP set point 531,32 . The exact mechanism underlying 
the significant STK39-hypertension association in this study is so far 
unclear, and thus if involved, these intronic polymorphisms might be 
in linkage disequilibrium with other flanking functional loci on 
STK39 gene that affect the final bioavailability of STK39. 

Extending the previous observations, our findings revealed strong 
evidence that STK39 gene multiple polymorphisms may act inter- 
actively to increase hypertension risk. It is generally believed that the 
relative risk attributable to a single locus is small. To make up for this 
flaw, we first applied a haplotype analysis by pooling all examined 
polymorphisms together. Haplotype is composed of different alleles, 
and so haplotype analysis could be more informative regarding the 
effect of genetic interaction on phenotype than an analysis with a 
single polymorphic marker 33 . The possible reason of this effect may 
owe to the synergistic action between rs6749447-C and rs3754777-G 
alleles, as the haplotypes simultaneously possessing these two alleles 
were overrepresented in hypertensive patients, albeit their protective 
tendency in a single-locus analysis, or to the antagonist action 
between rs6749447-A and rs6433027-T alleles, as the haplotypes 
with simultaneous presence of these two alleles were frequently 
observed in normotensive controls. In view of the complex allelic 
interaction in a haplotype, we then adopted a promising data- mining 
approach MDR. This approach is nonparametric and model-free in 
design and has been successfully applied to detect and characterize 
high-order gene-gene and gene-environment interactions in studies 
with relatively small samples 3,34 . By using MDR approach, we iden- 
tified a significant interaction between rs6749447, rs35929607 and 



Table 6 Summary of MDR analysis 


Best combination of each model 


Testing accuracy 


Cross-validation consistency 


P 


rs3 754777 


0.6130 


7 


0.0017 


rs6749447, rs3 754777 


0.7150 


10 


<0.0001 


rs6749447, rs35929607, rs3754777 


0.7309 


10 


<0.0001* 


rs6749447, rs35929607, rs3754777, rs6433027 


0.7329 


10 


<0.0001 


rs6749447, rs 12692877, rs35929607, rs3754777, rs6433027 


0.7303 


10 


<0.0001 


*The overall best MDR model. 
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rs3754777. Further in interaction entropy investigation, despite the 
predominant role of rs6749447 and rs3754777, their joint effect was 
strongly attenuated as reflected by the information gain value. 
Moreover, the predictive value of re6437027 was totally offset by 
rs6749447 with negative entropy value. These interactions further 
reinforced the results of our haplotype analysis. Although the inde- 
pendent main effect of rs3 5929607 was extremely low, its potential 
role was fully embodied by interacting with rs3754777. Considering 
the ubiquity of epistasis in determining hypertension susceptibility, it 
is a high priority to examine the interaction of more candidate genes 
or pathways. 

This study has several possible limitations. First, the cross-sec- 
tional nature of our study did not allow us to make inference about 
the causality for the effects 35 . Second, this study of 1765 subjects may 
be underpowered to discern a valid, medically important effect. 
However, a sample size calculation ensured us with more than 98% 
statistical power to detect significance. Third, this study had a limited 
coverage of genetic markers in STK39 gene, and it is highly encour- 
aged to incorporate other polymorphisms, especially some low- 
penetrance mutations or other logical hypertension-susceptibility 
genes, such as the genes encoding CaMK4 36 and GRKs 37-38 . Fourth, 
due to our design flaw, some clinical and lifestyle details on the study 
population were not available, as well as other reproducible risk 
factors for hypertension, which prevented further adjustment in risk 
estimates and may have overestimated the true effect size. Fifth, the 
MDR approach used in this study has some underlying drawbacks, 
including computational intensiveness, indistinct interpretation, 
lack of sensitivity and heterogeneity-free assumption 39,40 . Last but 
not the least, the fact that our study subjects were of northeastern 
Han Chinese descent limits the generalizability of our findings and 
necessitates further confirmation in other ethnic populations. 

Despite these limitations, our findings suggest the potential inter- 
active role of STK39 gene multiple polymorphisms in the develop- 
ment of hypertension among northeastern Han Chinese. As the 
development of hypertension is believed to be largely under genetic 
control, more emphases should be placed on the detection and char- 
acterization of multiple genetic interactions to predict high-risk indi- 
viduals for prevention and personalized treatment of hypertension. 
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